Convex Optimization for Parameter Synthesis in MDPs
نویسندگان
چکیده
Probabilistic model-checking aims to prove whether a Markov decision process (MDP) satisfies temporal logic specification. The underlying methods rely on an often unrealistic assumption that the MDP is precisely known. Consequently, parametric MDPs (pMDPs) extend with transition probabilities are functions over unspecified parameters. parameter synthesis problem compute instantiation of these parameters such resulting We formulate as quadratically constrained quadratic program, which nonconvex and NP-hard solve in general. develop two approaches iteratively obtain locally optimal solutions. first approach exploits so-called convex–concave procedure (CCP), second utilizes sequential convex programming (SCP) method. techniques improve runtime scalability by multiple orders magnitude compared black-box CCP SCP merging ideas from optimization probabilistic model-checking. demonstrate satellite collision avoidance hundreds thousands states tens their wide range commonly used benchmarks.
منابع مشابه
Algorithms for CVaR Optimization in MDPs
In many sequential decision-making problems we may want to manage risk by minimizing some measure of variability in costs in addition to minimizing a standard criterion. Conditional value-at-risk (CVaR) is a relatively new risk measure that addresses some of the shortcomings of the well-known variance-related risk measures, and because of its computational efficiencies has gained popularity in ...
متن کاملAntenna array pattern synthesis via convex optimization
We show that a variety of antenna array pattern synthesis problems can be expressed as convex optimization problems, which can be (numerically) solved with great eeciency by recently developed interior-point methods. The synthesis problems involve arrays with arbitrary geometry and element directivity, constraints on far and near eld patterns over narrow or broad frequency bandwidth, and some i...
متن کاملSequential Convex Programming for the Efficient Verification of Parametric MDPs
Multi-objective verification problems of parametric Markov decision processes under optimality criteria can be naturally expressed as nonlinear programs. We observe that many of these computationally demanding problems belong to the subclass of signomial programs. This insight allows for a sequential optimization algorithm to efficiently compute sound but possibly suboptimal solutions. Each sta...
متن کاملBudget Optimization for Sponsored Search: Censored Learning in MDPs
We consider the budget optimization problem faced by an advertiser participating in repeated sponsored search auctions, seeking to maximize the number of clicks attained under that budget. We cast the budget optimization problem as a Markov Decision Process (MDP) with censored observations, and propose a learning algorithm based on the wellknown Kaplan-Meier or product-limit estimator. We valid...
متن کاملNatasha: Faster Non-Convex Stochastic Optimization via Strongly Non-Convex Parameter
Given a nonconvex function f(x) that is an average of n smooth functions, we design stochastic first-order methods to find its approximate stationary points. The performance of our new methods depend on the smallest (negative) eigenvalue −σ of the Hessian. This parameter σ captures how strongly nonconvex f(x) is, and is analogous to the strong convexity parameter for convex optimization. At lea...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Transactions on Automatic Control
سال: 2022
ISSN: ['0018-9286', '1558-2523', '2334-3303']
DOI: https://doi.org/10.1109/tac.2021.3133265